Scan-to-XML for Vector Graphics : an experimental setup for intelligent browsable document generation
Identifieur interne : 008F43 ( Main/Exploration ); précédent : 008F42; suivant : 008F44Scan-to-XML for Vector Graphics : an experimental setup for intelligent browsable document generation
Auteurs : Bart Lamiroy ; Laurent Najman ; Romain Ehrhard ; Céline Louis ; Franck Quélain ; Nicolas Rouyer ; Nabil ZeghacheSource :
English descriptors
Abstract
This paper describes an experimental setup, conducted in collaboration with the ISA research group of the LORIA laboratory, Océ-PLT, and students from the École des Mines de Nancy. The main objective is to experiment an approach to develop a high level document analysis platform by composing existing bricks from a comprehensive library of state-of-the art algorithms. The test-case of this methodology consists in the realization of a fully automated method of generating a browsable, hyper-linked document from a simple scanned image. We concentrated our work on cutaway diagrams. These documents present the advantage of containing simple browsing semantics, in the sense that they consist of a clearly identifiable legend containing index references, plus a drawing containing one or more occurrences of the same indices. The setup described in this paper starts from a raw binary image of a cutaway diagram, and delivers an XML description matching the references of the legend with the indices in the image, and a browser for interpreting the XML generated map. The complete document treatment pipeline is conceived within a combined scripting and compiled library environment.
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Crin, to step Corpus: 002E15
- to stream Crin, to step Curation: 002E15
- to stream Crin, to step Checkpoint: 001559
- to stream Main, to step Merge: 009464
- to stream Main, to step Curation: 008F43
Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" wicri:score="416">Scan-to-XML for Vector Graphics : an experimental setup for intelligent browsable document generation</title>
</titleStmt>
<publicationStmt><idno type="RBID">CRIN:lamiroy01a</idno>
<date when="2001" year="2001">2001</date>
<idno type="wicri:Area/Crin/Corpus">002E15</idno>
<idno type="wicri:Area/Crin/Curation">002E15</idno>
<idno type="wicri:explorRef" wicri:stream="Crin" wicri:step="Curation">002E15</idno>
<idno type="wicri:Area/Crin/Checkpoint">001559</idno>
<idno type="wicri:explorRef" wicri:stream="Crin" wicri:step="Checkpoint">001559</idno>
<idno type="wicri:Area/Main/Merge">009464</idno>
<idno type="wicri:Area/Main/Curation">008F43</idno>
<idno type="wicri:Area/Main/Exploration">008F43</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en">Scan-to-XML for Vector Graphics : an experimental setup for intelligent browsable document generation</title>
<author><name sortKey="Lamiroy, Bart" sort="Lamiroy, Bart" uniqKey="Lamiroy B" first="Bart" last="Lamiroy">Bart Lamiroy</name>
</author>
<author><name sortKey="Najman, Laurent" sort="Najman, Laurent" uniqKey="Najman L" first="Laurent" last="Najman">Laurent Najman</name>
</author>
<author><name sortKey="Ehrhard, Romain" sort="Ehrhard, Romain" uniqKey="Ehrhard R" first="Romain" last="Ehrhard">Romain Ehrhard</name>
</author>
<author><name sortKey="Louis, Celine" sort="Louis, Celine" uniqKey="Louis C" first="Céline" last="Louis">Céline Louis</name>
</author>
<author><name sortKey="Quelain, Franck" sort="Quelain, Franck" uniqKey="Quelain F" first="Franck" last="Quélain">Franck Quélain</name>
</author>
<author><name sortKey="Rouyer, Nicolas" sort="Rouyer, Nicolas" uniqKey="Rouyer N" first="Nicolas" last="Rouyer">Nicolas Rouyer</name>
</author>
<author><name sortKey="Zeghache, Nabil" sort="Zeghache, Nabil" uniqKey="Zeghache N" first="Nabil" last="Zeghache">Nabil Zeghache</name>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>automated generation</term>
<term>component algebra</term>
<term>document analysis</term>
<term>hyperlink</term>
<term>xml</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en" wicri:score="3902">This paper describes an experimental setup, conducted in collaboration with the ISA research group of the LORIA laboratory, Océ-PLT, and students from the École des Mines de Nancy. The main objective is to experiment an approach to develop a high level document analysis platform by composing existing bricks from a comprehensive library of state-of-the art algorithms. The test-case of this methodology consists in the realization of a fully automated method of generating a browsable, hyper-linked document from a simple scanned image. We concentrated our work on cutaway diagrams. These documents present the advantage of containing simple browsing semantics, in the sense that they consist of a clearly identifiable legend containing index references, plus a drawing containing one or more occurrences of the same indices. The setup described in this paper starts from a raw binary image of a cutaway diagram, and delivers an XML description matching the references of the legend with the indices in the image, and a browser for interpreting the XML generated map. The complete document treatment pipeline is conceived within a combined scripting and compiled library environment.</div>
</front>
</TEI>
<affiliations><list></list>
<tree><noCountry><name sortKey="Ehrhard, Romain" sort="Ehrhard, Romain" uniqKey="Ehrhard R" first="Romain" last="Ehrhard">Romain Ehrhard</name>
<name sortKey="Lamiroy, Bart" sort="Lamiroy, Bart" uniqKey="Lamiroy B" first="Bart" last="Lamiroy">Bart Lamiroy</name>
<name sortKey="Louis, Celine" sort="Louis, Celine" uniqKey="Louis C" first="Céline" last="Louis">Céline Louis</name>
<name sortKey="Najman, Laurent" sort="Najman, Laurent" uniqKey="Najman L" first="Laurent" last="Najman">Laurent Najman</name>
<name sortKey="Quelain, Franck" sort="Quelain, Franck" uniqKey="Quelain F" first="Franck" last="Quélain">Franck Quélain</name>
<name sortKey="Rouyer, Nicolas" sort="Rouyer, Nicolas" uniqKey="Rouyer N" first="Nicolas" last="Rouyer">Nicolas Rouyer</name>
<name sortKey="Zeghache, Nabil" sort="Zeghache, Nabil" uniqKey="Zeghache N" first="Nabil" last="Zeghache">Nabil Zeghache</name>
</noCountry>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 008F43 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 008F43 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Lorraine |area= InforLorV4 |flux= Main |étape= Exploration |type= RBID |clé= CRIN:lamiroy01a |texte= Scan-to-XML for Vector Graphics : an experimental setup for intelligent browsable document generation }}
This area was generated with Dilib version V0.6.33. |